<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html
     PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
     "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">
<head>
<title>LingPipe Home</title>
<meta http-equiv="Content-type"
      content="application/xhtml+xml; charset=utf-8"/>
<meta http-equiv="Content-Language"
      content="en"/>
<link href="web/css/lp-site.css"
      type="text/css"
      rel="stylesheet"
      title="lp-site"
      media="screen,projection,tv"/>
<link href="web/css/lp-site-print.css"
      title="lp-site"
      type="text/css"
      rel="stylesheet"
      media="print,handheld,tty,aural,braille,embossed"/>
</head>

<body>

<div id="header">
<h1 id="product">LingPipe</h1><h1 id="pagetitle">Home</h1>
<a id="logo"
   href="http://alias-i.com/"
  ><img src="web/img/logo-small.gif" alt="alias-i logo"/>
</a>
</div><!-- head -->


<div id="navig">

<!-- set class="current" for current link -->
<ul>
<li><a class="current" href="index.html">home</a></li>

<li><a href="web/demos.html">demos</a></li>

<li><a href="web/licensing.html">license</a></li>

<li>download
<ul>
<li><a href="web/download.html">lingpipe core</a></li>
<li><a href="web/models.html">models</a></li>
</ul>
</li>

<li>docs
<ul>
<li><a href="web/install.html">install</a></li>
<li><a href="demos/tutorial/read-me.html">tutorials</a></li>
<li><a href="docs/api/index.html">javadoc</a></li>
<li><a href="web/book.html">textbook</a></li>
</ul>
</li>

<li>community
<ul>
<li><a href="web/customers.html">customers</a></li>
<li><a href="http://groups.yahoo.com/group/LingPipe/">newsgroup</a></li>
<li><a href="http://lingpipe-blog.com/">blog</a></li>
<li><a href="web/bugs.html">bugs</a></li>
<li><a href="web/sandbox.html">sandbox</a></li>
<li><a href="web/competition.html">competition</a></li>
<li><a href="web/citations.html">citations</a></li>
</ul>
</li>

<li><a href="web/contact.html">contact</a></li>

<li><a href="web/about.html">about alias-i</a></li>
</ul>

<div class="search">
<form action="http://www.google.com/search">
<p>
<input type="hidden" name="hl" value="en" />
<input type="hidden" name="ie" value="UTF-8" />
<input type="hidden" name="oe" value="UTF-8" />
<input type="hidden" name="sitesearch" value="alias-i.com" />
<input class="query" size="10%" name="q" value="" />
<br />
<input class="submit" type="submit" value="Search" name="submit" />
<span style="font-size:.6em; color:#888">by&nbsp;Google</span>
</p>
</form>
</div>

</div><!-- navig -->

<!--
<div id="contentnav">

<div id="breadcrumbs">
lingpipe
>
<a href="index.html" class="current">home</a>
</div>

<div id="contentmenu">
<a href="#whatslp">what is it?</a><a href="#whatsnew">news</a></div>

</div>
-->

<div id="content" class="content">

<h2>How Can We Help You?</h2>


<ul class="big">
<li>Get the latest version: <a href="web/download.html">Free and Paid Licenses/Downloads</a></li>
<li>Learn how to use LingPipe: <a href="demos/tutorial/read-me.html">Tutorials</a> </li>
<li>Get expert help using LingPipe: <a href="web/services.html">Services</a> </li>
</ul>

<a href="http://www.facebook.com/widgets/like.php?href=http://www.facebook.com/profile.php?id=100001847292531#!/pages/LingPipe/166735896694571?v=info">Join us on Facebook</a>





<h2>What is LingPipe?</h2>
<p>
LingPipe is tool kit for processing text using computational linguistics. LingPipe is used
to do tasks like:
</p>
<ul>
<li>Find the names of people, organizations or locations in news</li>
<li>Automatically classify Twitter search results into categories</li>
<li>Suggest correct spellings of queries</li>
</ul>
<p>To get a better idea of the range of possible LingPipe uses, visit our <a href="demos/tutorial/read-me.html">tutorials</a> and <a href="web/sandbox.html">sandbox</a>.
</p>


<h3>Architecture</h3>
<p>
LingPipe's architecture is designed to be efficient, scalable, reusable,
and robust.  Highlights include:
</p>

<ul>
<li>
Java API with source code and unit tests;
</li>

<li>
multi-lingual, multi-domain, multi-genre models;
</li>

<li>
training with new data for new tasks;
</li>

<li>
n-best output with statistical confidence estimates;
</li>

<li>
online training (learn-a-little, tag-a-little);
</li>

<li>
thread-safe models and decoders for concurrent-read exclusive-write
(CREW) synchronization; and
</li>

<li>
character encoding-sensitive I/O.
</li>

</ul>




<h2>Latest Release: LingPipe&nbsp;4.1.0</h2>

<div class="sidebar">
<h2>Download Latest: LingPipe&nbsp;4.1.0</h2>
<ul>
<li>
<a href="web/download.html">Download page for LingPipe&nbsp;4.1.0</a>
</li>
</ul>
<p>
Entire distribution contains the precompiled jar, javadoc, source, tests,
libs, tutorials and demos.  Just add Java.
</p>
</div>


<div class="sidebar">
<h2>Download Last 3.9.x Version</h2>
<ul>
<li>
<a href="../lingpipe-3.9.3">LingPipe&nbsp;3.9.3 Home Page</a>
</li>
</ul>
<p>
This is a link to another top-level page like this one, which
itself contains download links.
</p>
</div>


<h3>Intermediate Release</h3>

<p>The latest release of LingPipe is LingPipe&nbsp;4.1.0, which is a
feature release, as well as patching some bugs.  It is fully
backward compatible with LingPipe version 4.0.1.
</p>

<h4>Character, Token, and Document Suffix Arrays</h4>

<p>The largest addition in LingPipe 4.1 is suffix arrays.
The package <code>com.aliasi.suffixarray</code> contains
classes for suffix arrays of characters, of tokens, or
of tokenized documents with links back to the documents
from the suffix array.  Suffix arrays support finding
arbitrary length repeated strings in a large text
collection. </p>

<h4>Serialization for Language Models</h4>

<p>We also added serializability to a number of the language
model implementations which helps them play nicely with our
classifiers, taggers, etc.</p>

<h4>TF/IDF Classifier Access Methods</h4>

<p>We added methods to TF/IDF classifiers to access the
raw IDF values for terms and raw IDF values for term/document pairs.
</p>

<h4>Line Tagging Parser</h4>

<p>The line tagging parser was updated to handle more general
end-of-line markers across platforms.</p>


<h4>Single-Link Clustering Bug</h4>

<p>We fixed a bug in single-link clustering which caused elements
further away than the distance bound from all other elements to
disappear.</p>

<h4>Tests Fork</h4>

<p>If you run our top-level API test through Ant, you'll find they're
much slower, as in about four times slower.  This isn't because
LingPipe is slower, but because we rewrote the test call to fork a new
process for each test.  This allows the tests to succeed out of the
box with under 1MB memory on the Macintosh OSX platform with their
Java.</p>

<h3>Migration from LingPipe 3 to LingPipe 4</h3>

<p>LingPipe&nbsp;4.1.0 is <i>not</i>&nbsp; backward compatible
with LingPipe&nbsp;3.9.3.</p>

<p>Programs that compile in LingPipe&nbsp;3.9.3 without deprecation
warnings should compile and run in Lingpipe&nbsp;4.1.0.  
</p>


<h3>Downloading Last 3.9 Version: LingPipe 3.9.3</h3>

<p>The last 3.9 version of LingPipe before the major refactoring
is available at:</p>
<ul>
<li>
<a href="../lingpipe-3.9.3" rel="nofollow">Lingpipe&nbsp;3.9.3 Home</a>
</li>
</ul>


</div><!-- content -->

<div id="foot">
<p>
&#169; 2003&ndash;2011 &nbsp;
<a href="mailto:lingpipe@alias-i.com">alias-i</a>
</p>
</div>
<script type="text/javascript">
var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www.");
document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E"));
</script>
<script type="text/javascript">
try {
var pageTracker = _gat._getTracker("UA-15123726-1");
pageTracker._trackPageview();
} catch(err) {}</script></body>
</html>


